4.3 P-Values, Confidence Regions

1 P-Values

Suppose ϕ(X) rejects for large values of T(X). We can informally define p-value as the "under null hypothesis, probability that T(X) is as large or larger than what we observed". I.e. p(x)=PH0(T(X)T(x))=supθΘ0Pθ(T(X)T(x)).

Now we give a formal definition:

P-Value

Given P,Θ0,Θ. Assume we have a test ϕα for each significance level supθΘ0Eθϕα(X)α. (For non-randomized case, it's ϕα=1{xRα})
Assume tests are monotone in α: if α1α2, then ϕα1(x)ϕα2(x). (For non-randomized case, it's Rα1Rα2)
Then p-value is p(x)=sup{α|ϕα(x)<1}(=sup{α:xRα}).

ϕ here measures how "extreme" an observed T(X) is.

For θΘ0, Pθ(p(x)α)=Pθ(sup{α~|ϕα~(x)<1}α)limε0+Pθ(ϕα+ε(x)=1)limε0+(α+ε)=α.
So p-value stochastically dominates u[0,1].
If ϕα rejects for large T(X), reduces to original definition.

Note the p-value is defined based on

2 Confidence Sets

2.1 Definition

Confidence Set

C(X) is a 1α confidence set for g(θ) if Pθ(C(X)g(θ)1α),θΘ.
We say C(X) covers g(θ) if C(X)g(θ).
Pθ(C(X)g(θ)) is coverage probability.
infθPθ(C(X)g(θ)) is confidence level.

2.2 Duality of Testing & Confidence Sets

Suppose we have a level- α test ϕ(x;a) of (2.1)H0:g(θ)=a vs H1:g(θ)a,ag(Θ).
We can use it to make a confidence set for g(θ):
Let C(X)={a|ϕ(x;a)<1} (all non-rejected values of θ). Then Pθ(C(X)g(θ))=Pθ(ϕ(x;g(θ))=1)α,θ.
Alternatively, suppose C(X) is a 1α confidence set for g(θ). We can use C to construct a test ϕ(X) of (2.1): let ϕ(X)=1{aC(X)}. For θ:g(θ)=a, Eθϕ(X)=Pθ(C(X)g(θ))α. This is called inverting the test.

2.3 Confidence Interval for Median

For nonparametric model X1,,Xni.i.dF, (F is any c.d.f) Define g(F)=median(F)=F1(12). Consider two-sided test H0:g(F)=μF(μ)=12 vs H1:g(F)μF(μ)12.
Denote S(X;μ)=#{Xi>μ}Binomial(n,1F(μ))=H012. Reject for T(X;μ)=|S(X;μ)n2|>cα. Then μC(X)|S(X;μ)n2|cα#{Xi>μ}[n2cα,n2+cα]μ[X(n2cα),X(n2+cα)].

3 Confidence Intervals/Bounds

If C(X)=[C1(X),C2(X)], we say C(X) is a confidence interval (CI).

We usually get LCB/UCB by inverting a one-sided test in appropriate direction called uniformly most accurate (UMA) if test UMP. And get CI by inverting a two-sided test called UMAU if test is UMPU.